Should I understand my expected tokens per minute (TPM) usage before migrating workloads?
I'm considering migrating my workloads, but I'm not sure if I should first understand my expected tokens per minute (TPM) usage. Is it necessary to have this information before the migration?